Fast Loosely-Timed Deep Neural Network Models with Accurate Memory Contention
نویسندگان
چکیده
The emergence of data-intensive applications, such as Deep Neural Networks (DNN), exacerbates the well-known memory bottleneck in computer systems and demands early attention design flow. Electronic System-Level (ESL) using SystemC Transaction Level Modeling (TLM) enables effective performance estimation, space exploration, gradual refinement. However, contention is often only detectable after detailed TLM-2.0 approximately-timed or cycle-accurate RTL models are developed. A detected at a late stage can severely limit available choices even require costly redesign. In this work, we propose novel loosely-timed contention-aware (LT-CA) modeling style that offers high-speed simulation close to traditional (LT) models, yet shows same accuracy for low-level (AT) models. Thus, our proposed LT-CA breaks speed/accuracy tradeoff between regular LT AT fast accurate observation visualization contention. Our extensible model generator automatically produces desired TLM-1 from DNN architecture description exploration focusing on We demonstrate approach with real-world industry-strength application, GoogLeNet. experimental results show 46x faster than equivalent an average error less 1% simulated time. Early detection contentions also suggests local memories computing cores eliminate applications.
منابع مشابه
Deep Neural Network Language Models
In recent years, neural network language models (NNLMs) have shown success in both peplexity and word error rate (WER) compared to conventional n-gram language models. Most NNLMs are trained with one hidden layer. Deep neural networks (DNNs) with more hidden layers have been shown to capture higher-level discriminative information about input features, and thus produce better networks. Motivate...
متن کاملFast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network
We propose a highly efficient and faster Single Image Super-Resolution (SISR) model with Deep Convolutional neural networks (Deep CNN). Deep CNN have recently shown that they have a significant reconstruction performance on single-image super-resolution. Current trend is using deeper CNN layers to improve performance. However, deep models demand larger computation resources and is not suitable ...
متن کاملMultifeature Modular Deep Neural Network Acoustic Models
This paper presents and examines multifeature modular deep neural network acoustic models. The proposed setup uses well trained bottleneck networks to extract features from multiple combinations of input features and combines them using a classification deep neural network (DNN). The effectiveness of each feature combination is evaluated empirically on multiple test sets for both a classical DN...
متن کاملFast and accurate recurrent neural network acoustic models for speech recognition
We have recently shown that deep Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) outperform feed forward deep neural networks (DNNs) as acoustic models for speech recognition. More recently, we have shown that the performance of sequence trained context dependent (CD) hidden Markov model (HMM) acoustic models using such LSTM RNNs can be equaled by sequence trained phone models in...
متن کاملFast and Accurate Inference with Adaptive Ensemble Prediction in Image Classification with Deep Neural Networks
Ensembling multiple predictions is a widely used technique to improve the accuracy of various machine learning tasks. In image classification tasks, for example, averaging the predictions for multiple patches extracted from the input image significantly improves accuracy. Using multiple networks trained independently to make predictions improves accuracy further. One obvious drawback of the ens...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions in Embedded Computing Systems
سال: 2023
ISSN: ['1539-9087', '1558-3465']
DOI: https://doi.org/10.1145/3607548